Voice Quality and f0 in Prosody: Towards a Holistic Account
نویسندگان
چکیده
This paper presents a discussion of the role of voice quality in prosody. Illustrations from past production and perception data by the authors indicate that source parameters other than f0 are an inherent part of prosody, implicated in both its linguistic and paralinguistic functions. While prosodic (intonational) analyses of a language tend to be largely presented in terms of f0 dynamics, the argument here is for an integrative approach, where f0 and voice quality – two dimensions of the voice source – are treated together, and are related to the temporal/rhythmic structure of utterances. This should yield a fuller understanding of the nature of prosody and of the underlying production and perceptual correlates of prosodic elements such as pitch accent, declination, focus, phrase boundaries, etc. Such an approach may also serve to bring together the currently fragmented accounts of two core aspects of prosodic functioning: its role in signalling (i) linguistic, contrastive and discourse-related information and (ii) in communicating speaker affect, i.e. mood, emotional state and attitude. While the illustrations presented here provide initial hypotheses, a newly initiated project on Irish prosody will seek to incorporate such a holistic approach to prosodic analysis.
منابع مشابه
F0, voice quality, and Danish stød revisited
Danish stød is a syllable prosody, hitherto described as a kind of creaky voice with tonal side effects. One such is the well established, though not ubiquitous, abrupt lowering of F0 towards the end of the syllable. Eli Fischer-Jørgensen found, in the 1970s and 1980s, a difference also in the beginning of stressed syllables: F0 is higher in the beginning of syllables with stød. Confirming her ...
متن کاملVoice quality as a pitch-range indicator
Pitch perception plays a central role in processing speech prosody. Since f0 varies from speaker to speaker and from context to context, effective pitch-range normalization is thus important to uncover intended linguistic pitch targets. It has also been speculated that voice quality may play a role in pitch-range perception. Our previous study demonstrated that spectral balance indeed effective...
متن کاملIntonation issues in HMM-based speech synthesis for Vietnamese
In an HMM-based Text-To-Speech system, contextual features, including phonetic and prosodic factors have a significant influence to the spectrum, F0 and duration of the synthetic voice. This paper proposes prosodic features aiming at improving the naturalness of an HMM-based TTS system (VTed) for a tonal language, Vietnamese. The ToBI (Tones and Break Indices) features are used to learn two cru...
متن کاملProficiency Assessment of ESL Learner's Sentence Prosody with TTS Synthesized Voice as Reference
We investigate how to assess the prosody quality of an ESL learner’s spoken sentence against native speaker’s natural recording or TTS synthesized voice. A spoken English utterance read by an ESL leaner is compared with the recording of a native speaker, or TTS voice. The corresponding F0 contours (with voicings) and breaks are compared at the mapped syllable level via a DTW. The correlations b...
متن کاملPhysiological Factors Causing Tonal Characteristics of Speech: from Global to Local Prosody
Voice fundamental frequency (F0) determines the tonal quality of vowels, and its rise and fall comprise part of prosody in speech. This seemingly simple linear function results from highly complex physiological factors and thus lacks definitive explanations of the causal mechanisms. This report reviews previous studies and recent discoveries regarding the causal factors of F0 patterns and discu...
متن کامل